Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
který | 11431 | 419 | 1 | 419.0000 |
Po | 5995 | 309 | 1 | 309.0000 |
Roku | 2141 | 257 | 1 | 257.0000 |
Je | 4332 | 205 | 1 | 205.0000 |
Ve | 3330 | 180 | 1 | 180.0000 |
Při | 1879 | 145 | 1 | 145.0000 |
která | 8161 | 277 | 2 | 138.5000 |
Podle | 1827 | 113 | 1 | 113.0000 |
Jeho | 3287 | 220 | 2 | 110.0000 |
Během | 1547 | 101 | 1 | 101.0000 |
Od | 2864 | 100 | 1 | 100.0000 |
Tyto | 1073 | 90 | 1 | 90.0000 |
Jako | 1296 | 71 | 1 | 71.0000 |
Tato | 1707 | 136 | 2 | 68.0000 |
Z | 2566 | 136 | 2 | 68.0000 |
Další | 1350 | 65 | 1 | 65.0000 |
neboť | 1225 | 54 | 1 | 54.0000 |
avšak | 899 | 53 | 1 | 53.0000 |
Pro | 1733 | 102 | 2 | 51.0000 |
Obec | 2095 | 50 | 1 | 50.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
př | 1119 | 1 | 90 | 0.0111 |
tzv | 2916 | 1 | 76 | 0.0132 |
sv | 1114 | 1 | 60 | 0.0167 |
např | 3397 | 1 | 57 | 0.0175 |
č | 847 | 1 | 43 | 0.0233 |
km. | 494 | 1 | 38 | 0.0263 |
II | 1669 | 4 | 139 | 0.0288 |
III | 590 | 2 | 52 | 0.0385 |
m. | 1134 | 3 | 72 | 0.0417 |
stol | 215 | 1 | 20 | 0.0500 |
cm. | 238 | 1 | 20 | 0.0500 |
s. | 4479 | 6 | 107 | 0.0561 |
IV | 544 | 2 | 32 | 0.0625 |
Panny | 216 | 1 | 15 | 0.0667 |
mm. | 225 | 1 | 15 | 0.0667 |
vzniknout | 101 | 1 | 15 | 0.0667 |
VI | 170 | 1 | 15 | 0.0667 |
tvořit | 117 | 1 | 14 | 0.0714 |
přednost | 142 | 1 | 13 | 0.0769 |
považována | 306 | 1 | 13 | 0.0769 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II